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Abstract 

We first generalise ideas discussed by Kiss et al. (2015) to prove a theorem for 
generating exact closures (here expressing joint probabilities in terms of their 
constituent marginal probabilities) for susceptible-infectious-removed (SIR) dy¬ 
namics on arbitrary graphs (networks). For Poisson transmission and removal 
processes, this enables us to obtain a systematic reduction in the number of 
differential equations needed for an exact ‘moment closure’ representation of 
the underlying stochastic model. We define ‘transmission blocks’ as a possible 
extension of the block concept in graph theory and show that the order at which 
the exact moment closure representation is curtailed is the size of the largest 
transmission block. More generally, approximate closures of the hierarchy of 
moment equations for these dynamics are typically defined for the first and sec¬ 
ond order yielding mean-field and pairwise models respectively. It is frequently 
implied that, in principle, closed models can be written down at arbitrary order 
if only we had the time and patience to do this. However, for epidemic dy¬ 
namics on networks, these higher-order models have not been defined explicitly. 
Here we unambiguously define hierarchies of approximate closed models that 
can utilise subsystem states of any order, and show how well-known models are 
special cases of these hierarchies. 

1 Introduction 

A primary method for incorporating spatial structure and other contact struc¬ 
tures into epidemic models is to use a network of contacts [Tj. While simulation 
of stochastic models is straightforward on these networks, obtaining differen¬ 
tial equation descriptions of the relevant time series is more complex. Here we 
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consider the construction of a hierarchy of moment equations which, in statisti¬ 
cal physics, is sometimes known as the Bogoliubov-Born-Green-Kirkwood-Yvon 
(BBGKY) hierarchy after the names of its originators. The method was ap¬ 
plied to population-level network-based epidemic and ecological models in the 
1990s where truncation of the hierarchy was made at second order yielding pair- 
approximation models mmmm- Higher-order truncation of this hierarchy at 
the level of triples has also been investigated mica. With increasing compu¬ 
tational resources it has also become numerically viable to consider these hier¬ 
archies in terms of individuals, rather than population-level quantities 00 hd]. 
A particularly important feature of the individual-level representation is that it 
enables us to establish exactness for finite populations in certain circumstances 
(see m and also [IT and m by different methods). 

Here we generalise ideas discussed by Kiss et al. Qj] and also noted by New¬ 
man DU, and apply them to arbitrary directed networks. We also observe that 
they apply to non-Markovian as well as Markovian SIR dynamics. Depending 
on the network, we find that for Markovian dynamics, exact closed models exist 
at all levels of the hierarchy of moment equations. The exact models and exact 
closures considered in m and [12] then represent special cases. 

While the majority of moment closure models do not go beyond closure at 
the level of pairs (second order), it is frequently stated that, in principle, closed 
models at any order can be constructed. However, such higher-order models 
are rarely defined explicitly. Here, in the Markovian SIR epidemic context, we 
shall define hierarchies of closed models that can be constructed unambiguously 
at all orders by a systematic truncation method. In fact, we shall define and 
investigate several hierarchies of approximate models. All of these converge 
to exact representations at truncation orders which depend on the underlying 
network structure and all of them have either the pair-level model discussed 
in ■£] and [9] or the variant of this model discussed in m as the lowest (zeroth) 
order level of truncation. 

The next section discusses the relevant background concepts upon which our 
work builds. Section [3] introduces the exact closure theorem which defines the 
conditions under which simplifications to the hierarchy of equations can be made 
for particular networks. Section [I] introduces approximate closures leading to 
complete hierarchies of approximate models. 


2 Background concepts 

Apart from Theorem 13.11 which applies more generally, we shall consider a 
Markovian class of SIR models on contact networks. In particular, we con¬ 
sider a directed graph D = (V,A) consisting of IV = |Vj individuals/nodes and 
a set A of arcs. We also label each individual according to some arbitrary order¬ 
ing such that if i £ V then i £ {1,2,...,7V}. Each individual can be in only one 
of three states S, /, or R at any given time. Node j £ V, when infectious, makes 
‘infectious contacts’ to node i £V via a Poisson process of rate Rj > 0, where 
Tij > 0 (j, i) £ A and where we assume that T it = 0 for all i £ V. If node i is 
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susceptible when it receives an infectious contact then it immediately becomes 
infectious. It will then remain infectious for an exponentially distributed period, 
with parameter 7 *, after which it becomes recovered which is an absorbing state 
for the individual. We thus have a continuous-time Markov chain with a state 
space of size 3 ,v . Except where otherwise stated, we also assume initial con¬ 
ditions such that the states of all nodes are initially statistically independent. 
This assumption encompasses all pure-state initial conditions, such as a specific 
individual being infectious with all others susceptible, and it also incorporates 
binomially distributed initial conditions. Uniform initial conditions can also be 
exactly represented with additional computation ISl- 

Definition 2.1. Si, li and Ri denote the indicator random variables for the 
events that node i G V is susceptible, infectious and removed respectively. De¬ 
pending on the context, it will also be convenient to refer to Si, /, and Ri as the 
corresponding events themselves. 

The hierarchy comprises of a sequence of equations containing the first mo¬ 
ments and mixed moments of the random variables Si and Using angle 
brackets to denote expectation values, it can be shown [5] that the master equa¬ 
tion (or Kolmogorov forward equations) implies the following rate equations: 

N 

(Si) = -Y^TiASJi), 

3 =1 
N 

(ii) = (i) 

i=i 

where Silj is a product of the indicator random variables which also specifies a 
state of the subsystem of order two comprising of the pair of nodes i and j. For 
this pair state we have: 


N N 

(s'ilj) = ^TjkilSiSljh) -^^( 5 ^- 4 ) 

k^i k^j 

~ Tij (Silj) — 7 ? (Silj). ( 2 ) 

More generally, for these models, the master equation allows us to write 
down a rate equation for the probability of any subsystem state of size n in 
terms of subsystem states of size n and subsystem states of size n + 1. We state 
this as a theorem below (‘Theorem 12 . II) . 

Following prior work m , but with a notational simplification brought about 
by using the same index for all system and subsystem states, we define an 
alternative notation to Definition O that is useful for keeping track of the 
hierarchy of moment equations in this context. 

Definition 2.2. We use the following notation to denote subsystem states. 

• ipw is a subsystem comprising of the set of nodes W C V. 
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• Let A be a mapping from the elements of W to {5, I, i?} ; and let Ai be the 
image of node i £ W under A. Thus, A can also be interpreted as a pure 
state for subsystem tfw, he. the state where, for all i £ W, individual i is 
in state A j. 

• ipyf denotes the indicator random variable for the event that tfw is in state 
A. Thus the probability of the event that subsystem ifw Is in state A is 
P{ifw = A) = (ifw)- As in Definition \2.1l it is also convenient to refer 
to ipfyr as the event that ipw is in state A. 


Remark. For the event where node i is in a susceptible state, we can draw the 
following correspondence between the notations: ipf = Si, and similarly for the 
infectious and removed states. 

Definition 2.3. Let k £ W C V and X £ {S,I,R} and let A be a state of 
subsystem ifw- Then, h' k denotes the indicator random variable or event 

ifw’ but where the state of node k is changed to state X leaving the states of all 
other nodes unchanged. Note that if At, = X then h k (ifw) = V’ w■ 

Theorem 2.1. For any subsystem tpw> the probability that it is in state A is 
governed by the rate equation: 


it 


w / 


= Y 1 (A k = S) 
kew 

+ ^2 = -0 

kew 

+ 'Y2 = -0 

kew 

+ Y, H A k = R) 


- Y T kn l(A n = - Y TknWwIn) 

nev\w 


new 


Y Tknl{A n = I)(h s k ^w))-lk(i’w) 


new 


Y T kn {h s k {i^)I n ) 

nev\w 


kew 


Tk{h{{ipw)) 


( 3 ) 


where here, and throughout this paper, the indicator 1(.) is equal to 1 if its 
argument is true and is equal to zero otherwise. 

This theorem is proved in m- Starting with subsystem states that are 
only composed of susceptible or infectious individuals, repeated application of 
equation [3] to each of these states as well as to any subsystem states that arise 
on its right-hand side can never result in subsystem states with a removed 
individual. This is due to the absence of h k in equation [3] Hence, for these 
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subsystem states, t(Ak = R) = 0 for all k £ W so equation [3] becomes: 



E = S) 

kew 

+ E = ^ 

k&W 

+ E = 


-EM^ = J)«>- E T ^wln) 

n£_V\W 


new 


E T fc „l(A„ = I)(h s k (^)) - 7 fc(V>w) 


neW 


kew 


E T kn{h%(K)I n ) 

nev\w 


( 4 ) 


Equations [I] and [2] can now be seen to be special cases of this theorem. 

By applying equation [4] to every individual in the network for states S and 
I and then reapplying to every new subsystem state which emerges, we obtain 
a closed set of differential equations for a set M of subsystem states. However, 
\M\ will generally be very large for most systems, preventing numerical solution. 

To reduce the number of equations, we need to introduce a mechanism to 
curtail the generation of new subsystem states. In the next section, we discuss 
scenarios in which this can be done where the emerging system is still an exact 
representation of the underlying stochastic process. Following this, we shall 
consider hierarchies of approximate closed models. 


3 Exact closed models 

Here we prove a theorem pertaining to arbitrary SIR dynamics on arbitrary 
networks. We then use this to derive a class of exact models for Markovian 
SIR dynamics on arbitrary networks. We illustrate this with some examples, 
and finally state a theorem specifying the maximum size of subsystem needed 
to exactly represent the dynamics on any given network. 

3.1 Exact closure theorem 

For a given directed graph D = (V,A) with set V of nodes/individuals and set 
A of arcs, we make the following definitions: 

Definition 3.1. IN(X) is the set of individuals that can reach at least one 
member of X CV by following a directed path. Note that X C IN(X). 

Definition 3.2. Let X,Y,Z C V be disjoint and non-empty. The set of nodes 
Z is ‘dynamically partitioning ’ with respect to X and Y if and only if we have 
fE{X,Y,Z ) = 1 where: 

MV-2,4 1 2/VPO n IN(Y) = 0 (in D-Z) 

y 0 otherwise 

and D — Z is the vertex-set deleted subgraph consisting of nodes V \ Z. Here, 
E is chosen to represent ‘exact’; this is appropriate since we shall now see that 
}e{X,Y,Z) = 1 implies the existence of an exact closure relation. 
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Remark. If the network is undirected then /e(X, Y. Z) = 1 if and only if there 
is no path between X and Y in D — Z. 


Theorem 3.1. We consider stochastic SIR dynamics defined on a time-invariant 
network where the initial conditions are such that the states of individual nodes 
are initially statistically independent. Let V’jc) V’y ant ^ ' t l ) z ^ e indicator random 
variables or events where X, Y, Z C V are disjoint and nonempty. If Z is dy¬ 
namically partitioning with respect to X and Y, and all nodes in subsystem state 
C are susceptible ( Ci = S \/i £ Z), then provided that (ifz) 7 ^ 0, 


(ifx^Y^z) = 


Wz) 


( 6 ) 


Proof. If the infection has not passed through Z (which is guaranteed by all 
nodes in state C being susceptible), the states of the individuals in X are sta¬ 
tistically independent of the states of the individuals in Y. This is true since 
/e(X, Y, Z) = 1 implies that there are no individuals from which both a mem¬ 
ber of X and a member of Y can be reached without traversing a member of Z. 
We have: 


P{^xA’y^z\^z) = P{^x,^z\^z) p ^Yii>z\i>z)- 
Given that P(ifz) ^ 0, we have: 

Pj^X^Y^z) = P(^X^z) P(j’Y^z) 

P(lfC) P(lfC) PtyC) ’ 

from which the result follows. □ 

Remark. For the case of zero denominator, note that P(ifz) = 0 implies that 
P^x^y^z) = 0 - 

Notice that we made no assumptions about the SIR dynamics in proving this 
theorem and that it is therefore not restricted to Markovian systems, although 
it is the Markovian case that we shall be applying it to in the remainder of this 
paper. 

The theorem is a generalisation of the main result in |12| which is stated 
in terms of single dynamically partitioning individuals on undirected networks. 
In that context they are referred to simply as partitioning individuals due to 
their correspondence to graph partitioning. Some examples of where the ex¬ 
act closure theorem can be applied are shown in Figure |T] In this Figure and 
throughout the remainder of the paper, network links without arrowheads de¬ 
note undirected links whereas those with arrowheads denote directed links. 
Figure [lji is typical of the dynamical partitioning we shall consider in this 
paper. Applying Theorem 13.11 we see that there is dynamical partitioning 
about node 2, so we have ( I 1 S 2 S 3 I 4 S 5 ) = F° r Fig¬ 

ure mo we can dynamically partition about a cluster of susceptible nodes. In 
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Figure 1: Three examples of network states where the location of the susceptible 
nodes allows the application of the exact closure theorem. Here directed links 
have arrowheads and undirected links do not. 


fact there are two exact closures we can write down: (IiI 2 S 3 S 4 S 5 SeI 7 IsS 9 ) = 
(hI 2 S 3 S 4 S 6 )(S 3 S 4 S 5 S 6 I 7 I s S 9 )/(S 3 S 4 S 6 ) = (hl 2 S 3 S 4 S 5 S 6 )(S 3 S 4 S 5 I 7 I 8 S 9 )/(S 3 S 4 S 5 ). 
In Figure QJ: we can apply the exact closure theorem to obtain (SiI 2 S 4 S 5 lsI 7 ) = 
{Sil 2 S 4 )(S 4 ,S 3 IeI 7 )/(S 4 ). Note that I 3 is not included in this closure. 

For our purposes, we are interested in a special case of the exact closure 
theorem which is captured by the following corollary. 

Corollary 3.1. For subsystem state A of ipw, if Ak = S where k G W, and if 
fE{n , W \ k, k) = 1 where n G V \ W, then 




(4’w)(SkIn) 

(Sk) 


(7) 


This corollary is illustrated by the example 


in Figure [lji. By applying this 
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to equation 2] we obtain: 


(K) 


k£W 


E = 5 ) 
E ^ = 7 ) 


— E ^fen^C^n — I){lpw) 

new 

E r fen l(A„ = I)(h%Ww)) -7fe(^) 


fceiv 


neW 


- e i(^fc = 5) E 

n£V\W 

(h S k ('K))(SkIn) 


kew 

+f E (n,W\k,k) 


(l -fE{n,W\k,k)){hi(il4r)I n ) 


(Sk) 


E M-A-k = I) E ^kn 

kew nev\w 

(hki^W^iSkln) 


(l -f E (n,W\k,k))(h s k (K)In) 


+/e(^j W \k,k)- 


(Sk) 


( 8 ) 


For an arbitrary network, by applying equation [8] to the indicator random 
variables Si and Ii for all * G {1,2,..., N}, and then reapplying it to every new 
subsystem state that emerges, a closed set of differential equations for the exact 
time-evolution of the probability of an individual being in a particular state is 
obtained for all individuals. The number of equations that will be needed is 
limited by the closures that are made possible by the exact closure theorem. 

Definition 3.3. For a given network, the induced set Me of subsystem states 
is obtained by applying equation^ to every individual (for states S and I)in the 
network, and then reapplying to every new subsystem state that emerges. Me 
is then the full set of subsystem states that emerge during this process. 

Remark. It follows that Si and R (Vi G V) and SJj (fJi,j G V : Tij > 0) 
represent members of Me for any network. 


3.2 Examples 

Before determining the network structures under which dynamical partitioning 
occurs more generally, we consider some examples. For further examples in the 
context of undirected networks the reader is directed to [T2] . 


3.2.1 Example 1 

Definition 3.4. A network is a tree network if and only if its underlying graph 
(all directed edges are replaced by undirected edges) is a tree or forest. 












a) 


c) 



b) 




Figure 2: Some example graphs. For dynamics on these graphs, we assume a 
generic removal rate g and a transmission rate of 1 across all links. 


Theorem 3.2. For Markovian SIR dynamics on a tree network where the states 
of all individuals are initially statistically independent , the following equations 
hold exactly: 


(Si) 


(ii) 


< shj ) 


(SiSj) 


N 


(^ilj ) j 


3 = 1 


N 




3 =1 

\ N ' ^ (SiSj)(SjI k ) ^ ,. {SihXSJj) 

Z^ 1 ^ k / c ,\ Z^ ±ik / c.\ ’ 

fc=l,fc#i WJ' fe=l,fe^J 

—Tij (S'i/j} — 7j (S'i/j}, 

\' rp (SiSj){SiIk) t ( SiSj)(SjI k ) 

Z- (5.) z^ 


k=l,kj^j 


k=l,k^i 


(S 3 ) 


( 9 ) 


Proof. For such tree networks, every individual is dynamically partitioning rel¬ 
ative to any two of its neighbours on the underlying graph. Hence, the above 
system follows directly from repeated application of equation 0 starting with 

(Si), (Ii) Vi € V. □ 


Remark. This is the pairwise model that was shown to be exact for tree networks 

in HU. 


3.2.2 Example 2 

Consider the graph in Figure Or. Let us suppose that all nodes have the same 
removal rate g and that the transmission rate across all network links is unity. 
For simplicity we shall also make this assumption through the remainder of the 
explicit examples in this paper. We can apply Corollary 13.II which is embedded 
in equation [5] to build up the induced subsystem states Me- Let us just consider 
the infectious probability of node 1 to see how this works. We have: 

(h) = (S 1 I 2 ) - g(h). 
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Here and throughout the paper we order nodes according to the numerical order 
of their labels; the relevant motif structures need to be understood with reference 
to the associated graph. Now, node 2 is dynamically partitioning with respect 
to nodes f and 3, and it is also dynamically partitioning with respect to nodes 
1 and 4. Hence: 


(Sih) 


{■SiS 2 I 3 ) + (SxS 2 h) ^ (Sxh) - g{S 1 I 2 ) 


(S 1 S 2 )(S 2 I 3 ) (SxSJ^h) 
(S 2 ) + (S 2 ) 


(1 +g)(S 1 I 2 ). 


Rather than a complete analysis of all induced subsystem states that arise, we 
take the single pair state S 2 I 3 from this equation as an example. Here, node 
3 is not dynamically partitioning with respect to nodes 2 and 4 but node 2 is 
dynamically partitioning with respect to 1 and 3 so: 

(S2I3) = (S 2 S 3 h) - - < hS 2 I 3 ) - (1 + g)(S 2 I 3 ). 

W2/ 


Then for (S 2 S 3 l 4 ), node 2 is dynamically partitioning with respect to node 1 
and nodes 3 and 4 so: 

(S 2 S 3 h) = - (/l52)(5 f 3/4) - (2 + g)(S 2 S 3 h). 

W2) 

We see that here, Me represents a significant dimensional reduction in the num¬ 
ber of induced subsystem states compared to the full set of induced subsystem 
states M. 


3.2.3 Example 3 

For the undirected graph in Figure there is dynamical partitioning about 
node 1. Starting with (for example) the infectious probability for node 1, we 
have: 


(A) = <Si/ 2 ) + <Si/ 4 ) + (Sih) + (Sxh) - g(h ), 


where again we are assuming transmission rates of unity and a removal rate g 
for each node. Now, choosing the first of these pairs to develop one part of the 
induced set Me gives: 


(Sl'/ 2 > 


(SiS 2 I 3 ) - (Sxhh) - (5l/ ^ ( f /5 

Wl) 

-(1 +g)(S 1 I 2 ), 


(5i/ 2 )(5i/ 6 ) 

(Si) 


( 10 ) 


and then for the first of these triples: 


(Si5 2 / 3 ) 


too O T \ /C C T T \ {SlS2h){Slh) 

^01020324; — \Olb 2 l 3 l<x) -——r- 

(bi) 

-{l + g){SiS 2 I 3 ). 


(SxSM^iSxh) 

(Si) 
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For the first of these quads we have: 


(S'! 5253/4) 


(5,5253/4) (5, / 5 ) (5,5253/4) (5,/e) 

(Si) (Si) 

-(2 + g)(S 1 S 2 S 3 I 4 ). 


Here, the maximum size of a subsystem state is four. We note that this is equal 
to the size of the largest simple cycle and that this was also true for example 2. 
However, this is not always the case as shown by the next example. 


3.2.4 Example 4 

Figure [2J: shows a network where the maximum simple cycle size is 4 but the 
maximum size of a subsystem state in Me is 5. Starting with the infectious 
probability of node 1 we have: 

(/,) = (5,/ 2 ) + (Sih) + (Sil 5 ) - g(h). 

Then, taking just the subsystem state in the first term: 

<5i7 2 ) = (5,5 2 / 3 ) - (5i/ 2 /4) - (5,J 2 /5) - (1 +s)(5i/ 2 ), 

and again taking just the first term: 

(5,52/3) = (5i5 2 5 3 /4) + (5,5 2 5 3 / 5 ) - ( 5 , 53 / 3 / 4 ) - ( 5 , 52 / 3 / 5 ) 

-(l + <7)(5,5 2 / 3 ). (11) 

Finally, taking the first term again gives: 

(5,525 3 / 4 ) = -2(SiS 2 S 3 hI 5 ) - (2 + g)(S 1 S 2 S 3 I 4 ). (12) 

In this case we see that the maximum size of a subsystem state is at the size 
of the system (5 nodes) and is not constrained by the largest simple cycle (4 
nodes). This leads to the question: What aspect of a network specifies the 
largest subsystem size that appears in Me^ We answer this question in the 
following subsection. 


3.3 System size 

Here we define the type of network structures that are amenable to dynami¬ 
cal partitioning. We start from single node subsystems and expand out, via 
equation [51 until the largest subsystem is reached incorporating that individ¬ 
ual before dynamical partitioning prevents larger subsystems emerging. For the 
undirected case, the situation simplifies considerably [12] since all dynamically 
partitioning individuals are also cut-vertices (individuals which, when removed, 
increase the number of connected components). It is then helpful to represent 
the network as a collection of blocks (maximal biconnected subgraphs) where the 
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between-block structure is tree-like (see Figure 0Jr)- This makes it straightfor¬ 
ward to assess the feasibility of constructing a solvable exact system by making 
use of dynamical partitioning. Notice that it is possible for a node to belong 
to more than one block as in the top right of Figure 0b although the overlap 
between any two blocks can only be a single node. It is interesting that this 
representation of the network resembles the household model structure where 
analytic progress can also be made m- For directed networks, the situation is 
more complicated. Here we define ‘transmission blocks’ to play a similar role 
to blocks. Indeed, blocks and transmission blocks will have equivalent defini¬ 
tions in the undirected case. We use the term transmission block rather than 
block since there are likely to be other useful extensions of the block concept 
for directed networks. 

Definition 3.5. Let D = {V,A) be a directed graph with set V of nodes and set 
A of arcs. Let W C V. Then D[W] is the subgraph formed from the nodes of 
W and arcs with endpoints both in W. 

Definition 3.6. The subgraph D[W] is a ‘directed sub-block’ if and only if there 
is at least one node reachable from all others in D[W] and its underlying graph 
is biconnected. 

Remark. According to this definition, any block in an undirected network is also 
a directed sub-block. Hence, the blocks illustrated in Figure 0b are all directed 
sub-blocks. 

Definition 3.7. We will refer to a directed sub-block D[W] as a ‘transmission 
block’ if and only if there does not exist U D W such that D[U ] is also a directed 
sub-block. 

The shaded boxes in Figure0]are examples of transmission blocks. Figure [2b 
gives an example of these on a directed graph. Notice that now it is possible 
for transmission blocks to overlap by more than one node (the darker shaded 
triangle belongs to two transmission blocks). This happens when a region of 
the network has paths to two or more other regions that do not have paths 
between each other. Figure 0] shows some more examples of these definitions for 
directed networks. Figured and b have underlying graphs that are biconnected. 
Figure 0b also has a node (node 1) which is reachable from all others and 
so it is a directed sub-block whereas Figure 0b is not. Figure 0b is also a 
transmission block since it is maximal. Additionally, neither have sub-graphs 
of the underlying graphs that are biconnected and so neither contain directed 
sub-blocks as subgraphs. Figure 0b is a transmission block (the underlying 
graph is biconnected and node 2 is reachable from all others). It also contains 
several directed sub-blocks (for example nodes 1,2 and 3). Figure 0Ji contains a 
transmission block as a subgraph (nodes 1,2,3,4) and contains several directed 
sub-blocks. 

We can now state the main result on subsystem size: 

Theorem 3.3. The largest subsystem state in Me consists of the same number 
of individuals as the largest transmission block, or it contains 2 individuals if 
there are no transmission blocks. 
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a) Undirected network 




Figure 3: Examples of networks that decompose into transmission blocks. The 
transmission blocks are highlighted by the shaded rectangles. Darker areas are 
where two transmission blocks overlap, a) An undirected network where the 
effectiveness of dynamical partitioning is made clear by the number of distinct 
biconnected subgraphs which resemble structured households, b) A directed 
network where identifying the transmission blocks is more complicated. 
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Figure 4: Four directed graphs. Graph a) is not a transmission block whereas 
graphs b) and c) are transmission blocks. Graph d) contains a transmission 
block as a subgraph. 


Proof. The Theorem follows from Gorollarv lA.il and Lemma fA.41 (see Appendix 
A). From Corollary IA. 11 the individuals contained in a subsystem state larger 
than a pair appearing in Me belong to some transmission block. From Lemma fA.4l 
any transmission block appears as a subsystem state in Me- □ 


4 Hierarchies of approximate models 

The systems of equations in the previous section are exact, but limited in ap¬ 
plicability because of the limited scope for dynamical partitioning in most net¬ 
works. To suitably curtail the large number of equations, the networks need to 
have a structure which is roughly tree-like. 

More typically, we want to trade off some exactness for models which are 
numerically tractable and provide a good, rather than exact description of the 
underlying dynamics. The pair-level SIR model (equation [9]) is exact for tree 
networks but is also a reasonably good approximation for SIR dynamics on a 
wide range of networks. Higher-order models will typically be more accurate, 
but will have considerably greater computational cost. Here we formally de¬ 
fine hierarchies of approximate models that can be applied to Markovian SIR 
dynamics on any network. 

We define ‘pseudo-partitioning’ according to different criteria. We define two 
hierarchies of models via what we term ‘cycle-partitioning’ and ‘size-partitioning’. 
We then also consider a ‘hybrid-partitioning’ hierarchy utilising both methods. 
Although these pseudo-partitionings can be defined more generally, as in the 
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case of dynamical partitioning itself, we shall restrict our attention here to dy¬ 
namical partitioning with respect to single susceptible nodes. 

Generalising from the case of dynamical partitioning, we define a function 
f p (X, Y, i) to specify some pseudo-partitioning of subsets X and Y with respect 
to node i and enable a systematic curtailing of the number of subsystem states 
necessary for a solvable model. By analogy with equation [8l we have: 


<v^> * E 1(^ = 5) 

keW 

+ E = -o 


- E T kn l(A n = I)(v 4 ) 

new 

E Tk n t(A n = ^7 k{4’w) 


kew 


new 


- E M A k = s) e 

kGW n£V\W 

(h S k (^))(SkIn) 


(l - f P {n,W\k,kj)(h%W>&)I n ) 


+f P (n,W\k,k)- 


(Sk) 


E i (A k = i) E 

nev\w 

(h S k (K))(S k In) 


kew 

+f p (n,W\k,k) 


(l-f p (n,W\k,k))(h s k (K)In) 


(Sk ) 


(13) 


So, when / p (X, Y, *) = 1, we treat i as if it is dynamically partitioning with 
respect to X and Y and so the right-hand-side of the rate equation does not 
generate larger subsystem states. The specific type of approximate model de¬ 
pends on how f p (X, Y. i ) is defined and is formed by assuming equality between 
the left and right hand sides. 

Note that equation[C|] defines a solvable model that is based on the closure in 
equation!?! However, other closures such as the Kirkwood-closure ~ 

{ipfipf ){4>f i’k )(V ; fe , V’i 4 )/((' i /’i 4 )(V’ i f )(' t Pk )) outside of this scheme. It is, how¬ 

ever, straightforward to define a solvable hierarchy of approximate models that 
incorporates the standard Kirkwood closure as a special case. 

Let us denote the adjacency matrix for the underlying graph by U (Uij = 
sgn (Tij +Tji ) for all i,j € V). Then, for the probability of subsystem ifrw being 
in state A , we can approximate: 


(4’w) 


II 

n 

iew 


(14) 


where = Yljew Uij is the number of neighbours of node i in W in the 
underlying graph and is also the number of times that the state of node i 
appears on the numerator. For a fully connected subsystem of three nodes, this 
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is seen to reproduce the standard Kirkwood closure. Using this general idea but 
keeping the state if)yy intact as before, we can write an alternative to equation 0 
For i/jyyi if Ak = S, k GW and n G V \ W and we suppose there is an arc from 
n to k. then we can approximate: 


(t 




(V’u r )(^'fcd n ) 

(Sk) 


n 

jeW\k 


Win) 

Wf*)(In > 


(15) 


Examples of the application of this approximation can be found in section 14.41 
We use this approximation to motivate the following hierarchy: 


W) 


E MAk = S ) 

kew 

+ E = I) 


- ^ T kn t{A n = I)W) 
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E TknA{A n = I)(h k (lpw)) ^7 k{4’w) 


kGW 


new 


- 1 {A k =S) Tkn 

neV\W 


kew 


(l ~ f p (n,W\k,k))(h s k W)In) 


+f P (n,w\k,k) {hSk ^f}\ {Skin) n ^ { ^ ,In) 


(Sk) 


je 


W\k V (V’j 3 )(In) , 


+ E Tkn 

neV\W 


kew 


+f P (n, W\k, k) 


(l - f P (n,W\k,k))(h s k W)I n ) 
(hki’i'w)) (Skin) 


n 


(^In) \ Un 


^ k ^ jeW\k VW’j 3 )(In)j 


■ ( 16 ) 


Either equation 1 131 or [T^l can be used in conjunction with suitable definitions 
of f p (X, Y, i) to generate hierarchies of approximate models. We shall mostly use 
equation mi for explicit examples. However, for completeness, we shall briefly 
discuss equation mi hr section 14.41 

It is worth noting that both of these closures are based around a single IS 
arc being added each time. Other schemes with more complex closures should 
also be possible. For example, Theorem 13.11 allows closures where we do not 
necessarily need to have only singlet states in the denominator (see Figure [TJr)- 


4.1 Cycle-partitioning 

With reference to Figure [5l although node i is not dynamically partitioning 
with respect to W \ i and node j. we might observe that it is in some sense 
‘approximately’ dynamically partitioning because the path length between j 
and W is reasonably long when i is deleted. It seems sensible to define a type 
of pseudo-partitioning according to this path length. 
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Figure 5: An example of a node i £ W which is not dynamically partitioning 
with respect to node j and W \ i, but it is cycle-partitioning up to x = 2. 

Definition 4.1. The set of individuals that can reach at least one member 
of X C V, by traversing a £ IN arcs or less, is denoted IN a (X). Here and 
elsewhere, IN = {0,1, 2,...}. 

Definition 4.2. Node i £ V is ‘cycle-partitioning’ at order x £ N with respect 
to disjoint and non-empty subsets X , Y £ V, where i X U Y , if and only if we 
have fc(x)(X,Y,i) = 1 where: 



1 */ IN a (X) f! INb(Y) = 0 Va, b:a + b = x (in D — i) 


(17) 


0 otherwise 


where a,b £ N. 

We make the following observations: i) If the network is undirected then 
fc{x)(X,Y,i) = 0 if and only if there is at least one path of length x or less 
between some member of X and some member of Y when i is deleted, ii) An 
individual who is dynamically partitioning with respect to two subsets is also 
cycle-partitioning at all orders with respect to those subsets, iii) In Figure [5l 
node i is cycle-partitioning with respect to W \ i and j for x = 0, x = 1, and 
x = 2, but not x > 2. iv) Any individual i £ V is always cycle-partitioning at 
order x = 0 with respect to any other two subsets. 

Adapting Corollary [XU such that cycle-partitioning individuals of order x £ 
{0, 1 , 2 ,.. .} are ‘treated’ as dynamically partitioning individuals, we substitute 
f p (n , W\k, k) = fc(x)( n > W \ k, k ) into equation fldl 

Remark. By applying this rate equation to every individual in the network for 
states S and I and then reapplying to every new subsystem state which emerges, 
we obtain a closed set of differential equations which form the a;th model in a 
hierarchy of approximating models (note that the model corresponding to x = 0 
is the pair-level model given by equation ED- The associated set of induced 
subsystem states will be denoted by M c / X y 
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4.1.1 Examples 

We can consider cycle-partitioning for the network in Figure [ 5 ) 3 . If we cycle- 
partition at x = 1 , then the first two terms of equation [TUlare closed at the level 
of pairs. Specifically, for the first term, node 2 is cycle-partitioning with respect 
to nodes 1 and 3. For the second term, node 1 is cycle-partitioning with respect 
to nodes 2 and 4. This gives: 

• (S 1 S 2 )(S 2 I 3 ) (SiI 2 ){Sih) (5 1 J 2 )(5 1 / 5 ) <S , 1 / 2 )(S 1 J 6 > 

1 1 ~ <S 2 ) <^i> (5i) <5i) 

-(1+ 5 )<SiJ 2 >. 

Thus, triples within the square are no longer ‘kept intact’, and so, within the 
square, the model closes at the level of pairs. However, triples made up of the 
members of the triangle are kept intact. For example, we have: 

(S 5 I 6 ) = (hS 6 S 6 } - (hS 5 I 6 ) - (1 + g)(S 5 I 6 ). 

Figure [ 6 ] shows this hierarchy of models. Here, the x = 0 model is the pair- 
level model (equation [9]). The x = 1 model is an improvement since it picks up 
the triangle. The x = 2 model picks up the square as well and is equivalent to 
the exact closure model (consistent with the master equation). 

If we apply cycle-partitioning to Figure^ instead, then the x = 0 model is 
the pair-level model as always. The x = 1 model is also the pair-level model 
and the x = 2 model is equivalent to the exact closure model. Hence, cycle¬ 
partitioning does not necessarily lead to improved models as x increases and 
it does not always lead to a reduction in system size with respect to the exact 
closure model. The results from the x = 0 pair-level model and the exact 
model applied to Figure[2j: can be seen in the section on size-partitioning below 
(Figure [ 8 } and so are not reproduced here. 

An extreme example of the failure of cycle-partitioning to produce large 
hierarchies of approximate models is given by the triangular lattice shown in 
FigurefT] Here, the x = 0 model is the pair-level model. For x = 1, consider the 
triple A 3 S 1 C 4 (VA, C G {S', /}). Here we do not have cycle-partitioning since 
by deleting node 1, there is a path of length 1 between nodes 3 and 4. As we 
move to order 4 motifs, (e.g. adding a node to the above triple either by the 
edge (1,2) or the edge (3,2)), it is readily seen that there will always exist motifs 
which do not cycle-partition for x = 1 at all orders. Hence for the triangle 
lattice, even for x = 1 cycle-partitioning, we obtain a model with motif states 
at the size of the full network. Some cycle-partitioning does occur however, so 
the resulting model is not exact. For example, for the triple A 2 S 1 C 4 , deleting 
node 1 means that the shortest path from 2 to 4 is via node 3 and is of length 
2. So we have cycle-partitioning here. We also have it for states A7S1C4. This 
state is also cycle-partitioning at x = 2 (the path length from node 7 to node 4 
after deletion of node 1 is 3) but we no longer cycle-partition A 2 S 1 C 4 . Finally, 
at x = 3, no cycle-partitioning occurs anywhere and we have an exact model 
containing subsystem states at the size of the system (Mc( 3 ) = M). 
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Figure 6: Cycle-partitioning applied to the scenario in Figure EJo with x = 0 
which corresponds to the pair-level model through x = 1 and finally x = 2 which 
is exact for this scenario. Here we assume that all individuals are susceptible at 
time t = 0 with probability 5/6 and infected otherwise (the states of individuals 
are initially statistically independent). We have assumed a transmission rate of 
unity across each link and a removal rate of unity. 
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Figure 7: A triangle lattice - an extreme example where cycle-partitioning at 
order greater than x = 0 requires subsystem states which contain all individuals. 


In general, if the largest transmission block in a network has n individuals, 
then any cycle-partitioning model of order x > n — 2 is exact (see Theorem IB. II 
in Appendix B). This is illustrated by the network in Figure [2jo where the largest 
transmission block is of size n = 4 and the x = 2 cycle-partitioning model is 
exact (Figure [6]). This is also the case for the graph in Figure [2J: where n = 5 
and the x = 3 model is exact (the x = 2 model is also happens to be exact 
here as well). Another general result is that if the smallest directed sub-block 
consists of n individuals, then the cycle-partitioning models of order x < n — 2 
are all equivalent to the pair-level (x=0) models (see Theorem lB.2l in Appendix 
B). This is illustrated by the graph in Figure [2J; where the smallest directed 
sub-block is n = 4, and we found that the x = 1 cycle-partitioning model is the 
same as the pair-level model. 


4.2 Size-partitioning 

The issues arising in some networks such as FigureGfc, where even cycle-partitioning 
at x = 2 requires subsystem states containing all individuals, and the extreme 
example of the triangular lattice, motivate an alternative pseudo-partitioning 
approach whereby the sizes of subsystem states are more directly constrained. 

Definition 4.3. 


fs(x) (A") 


1 if |A| = x + 1 
0 otherwise 


(18) 
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where X C V and x £ N. 

Here we make the substitution f p (n , W\k , k) = fs(x)(W\k) into equation[T3l 

Remark. As with previous pseudo-partitioning, a complete approximate model 
arises from the equations for the individual-level states and then repeatedly 
applying this equation to each subsystem state that emerges. As with cycle¬ 
partitioning, the x = 0 size-partitioning model corresponds to the pair-level 
model. 


4.2.1 Examples 

As an example, consider the x = 1 size-partitioning model for Figure [2fc, where 
the cycle-partitioning hierarchy was redundant. Equation 1111 now becomes: 


(SiSah) 


(SiS 2 S 3 )(S 3 I A ) 

(S 3 ) 

{SiSzhXSJs) 

(Si) 


(S 1 S 2 S 3 )(S 3 I 5 ) 

(S 3 ) 

(1 + g)(S 1 S 2 I 3 ). 


(SiS 2 J 3 ><SiJ4> 

(Si) 


For x = 2 size-partitioning, equation [FT] is left untouched since the exact rate 
equation for a subsystem state of size 3 does not involve subsystem states larger 
than 4. However, equation fT2l becomes: 


(S 1 S 2 S 3 I A ) 


(SiS 2 S 3 I A )(SiI 3 ) 

(Si) 


(S 1 S 2 S 3 I A )(S 3 I s ) 

(S 3 ) 


(2 + g)(S 1 S 2 S 3 h). 


In this way, we obtain three different approximate models: x = 0, x = 1 and 
x = 2. For x > 2, the model is exact. Figure [8] shows results from the application 
of each of these three approximate models and the exact x = 3 model to SIR 
dynamics on the network depicted in Figure \2t'- An interesting feature that 
should be noted for the x = 2 model is that it very slightly underestimates the 
rate of spread of the epidemic. Typically, experience shows that the closure of 
these equations leads to over-estimation of the rate of spread, but this provides 
a counter example. 

While size-partitioning will generate a large hierarchy of approximate models 
where cycle partitioning fails to do so (such as for the triangular lattice), it has 
problems of its own. Specifically, we see from Figure [S] that since the smallest 
cycle size in Figure [2}: is 4, the x = 1 size-partitioning model is almost identical 
to the x = 0 pair-level model. The x = 3 and x = 2 models are also almost 
identical. Hence, the extra computation in evaluating at x = 1 and x = 3 is 
wasteful. In this sense, cycle-partitioning has an advantage by only picking up 
complete cycles in the network. 

An additional problem with size-partitioning is that it ignores genuine dy¬ 
namical partitioning. For example, for Figure [2}j, we would require motif sizes 
of 6 (x = 4) to describe this exactly within the size-partitioning scheme. How¬ 
ever, if we permit genuine dynamical partitioning, we only need motif sizes of 
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Figure 8: Size-partitioning applied to the scenario in Figure [2jc with x = 0 
which corresponds to the pair-level model through x = 1, x = 2, and finally 
x = 3 which is exact for this scenario. An individual is assumed to be initially 
susceptible with probability 4/5 and infected otherwise (the states of individuals 
are initially statistically independent). We have assumed a transmission rate of 
unity across each link and a removal rate of unity. 
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less than or equal to 4. This issue is readily resolved by considering the modified 
scheme: 


fE,s(x){X,Y,i) 


1 if f E {X,Y,i) = 1 or f S ( x ){Y) = 1 
0 otherwise 


(19) 


which incorporates genuine dynamical partitioning into size-partitioning. With 
this rule, in Figure 03, the genuine dynamical partitioning around node 1 is 
utilised wherever possible. 


4.3 Hybrid-partitioning 

Both cycle-partitioning and size-partitioning have their merits. Size-partitioning 
avoids unnecessarily large motif states where cycle-partitioning cannot be effec¬ 
tively implemented beyond an early stage, such as in the triangle lattice. On 
the other hand, cycle-partitioning picks out cycles in the network and closes 
at the pair level unless complete cycles can be incorporated, avoiding wasteful 
computation with minimal gain in accuracy. 

We can construct a hybrid-partitioning scheme that captures the benefits 
of both cycle-partitioning and size-partitioning while avoiding the problems of 
both. We define this hybrid-partitioning as: 

Definition 4.4. 


fc(x)S(x)(X,Y,i ) 


1 iffc(x)(X,Y,i) = lorf s{x) (Y) = l 

0 otherwise 


( 20 ) 


This leads to a hierarchy of models defined by substituting f p (n , W\k, k) = 
fc(x)S(x)( n i W \ k,k) into equation fT3l This also has the pair-level model for 
x = 0. We also note that alternative hierarchies could be designed with different 
values of x for the size-partitioning and the cycle-partitioning parts. 

This closure benefits from the advantages of both cycle-partitioning and 
size-partitioning. Firstly, if there are only large cycles, the hierarchy is closed 
at a low order by cycle-partitioning. This is desirable since, as illustrated in 
Figure 0 continuing on generates little benefit unless we are able to continue 
to the size of the smallest cycle. However, if the system is not amenable to 
cycle-partitioning, as in the triangular lattice, then size-partitioning is required. 
A network illustrating the benefits of this is shown in Figure 0 For hybrid¬ 
partitioning with x = 1, let us start with the probability that node 1 is infectious: 

(h) = (Sr/a) + (SHb) + {S x h) + (SJ 7 ) - g(h). (21) 


For the first of these terms on the right-hand-side, the corresponding approxi¬ 
mate differential equation is: 


(Sih) 


(SiS 2 )(S 2 I 3 ) <SiJ 2 )(SiJ 5 ) (5 1 / 2 )(5 1 / 6 ) 


(S 2 ) 

<SiJ 2 ><Si/ 7 > 

(Si) 


(S 1 ) 

-(l+sKSHa) 


(Si) 
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Figure 9: A graph that illustrates the benefits of hybrid-partitioning. Expanding 
from node 1 using x = 1, we utilise both cycle-partitioning and size-partitioning 
capturing the advantages of both. 


where we have employed x = 1 cycle-partitioning. For the term ( S\I§) in 
equation 12II we obtain: 

/c - rx (SrSeXW , /c , c r x (Sih)(Sih) 

( 15) -<Ss>- + {SlS5le) -(£>- 


~(Slhh) ~ 


(Si) 


where, again, x = 1 cycle-partitioning has been implemented where possible. 
For the second term in this expression, we have: 


(SiS 5 I 6 ) 


(SiS 6 S 6 ){S 6 I 7 ) {SiS^iSih) 
(Se) (Si) 

(SiS 5 I 6 )(hS 5 ) <5i5 5 / 6 )<Si/2) 


<S 5 > 


(Si) 


-(2 + g)(SiS 5 I 6 ). 


Here, the closures on the first line are via x = 1 size-partitioning, whereas the 
closures on the second line are via meeting the criteria for both x = 1 size¬ 
partitioning and x = 1 cycle-partitioning. 

So, this hybrid-partitioning obtains the best of both methodologies. Cycle¬ 
partitioning avoids unnecessarily including extra terms in the large cycle 1-2- 
3-4-5-1 which we have seen (Figure [8]) generates minimal extra accuracy. Size¬ 
partitioning forces partitioning where the motif sizes get beyond a specified 
level, here constraining the maximum motif size to be 3. 


4.4 Alternative closure 

Before leaving this section, we include a brief aside on using the alternative clo¬ 
sure defined in equation [15] In this case, we can still apply the cycle, size and 
hybrid methods, but we use equation H6l in place of equation [13] Two examples 
of applying this are illustrated in Figure [TUI Here the shaded regions represent 
the existing subsystem states and the solid lines coming out of these regions 
represents the new infectious node being added on. The dashed lines represent 
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Figure 10: Two simple examples of applying the alternative closure rule as 
encoded by equation]]!)] The shaded region specifies the initial subsystem state, 
and there is a new IS link towards it in accordance with the way in which 
the induced state spaces are built. The dashed lines represent additional links 
between the new node and the original subsystem (these would be ignored by 
the closure rule in equation [T511 


other links between the new infectious nodes and the original subsystems. Sup¬ 
posing that the criteria for pseudo-partitioning is met at this stage (i.e. the 
relevant f p (.) = 1), for Figure flOh we obtain 


(hS 2 I 3 ) 


(hS2)(S 2 I 3 )(IlI 3 ) 

(h)(S 2 )(I 3 ) 


and for Figure [TUb. we obtain 


(IiS 2 S 3 I^I 3 ) 


(hS 2 S 3 h)(hI 5 )(S 2 I 5 )(S 3 I 5 ) 

(h)(S 2 )(S 3 )(I 5 )2 


We note that for cycle-partitioning with x > 0, both closure methods become 
equivalent (equation [16] reduces to equation fill) since the types of additional 
links drawn in Figure [TUI could not be present. 

Notice that when the closure of triples always occurs (e.g. x = 0 cycle¬ 
partitioning or x = 0 size-partitioning), the variant of the pair-level models 
introduced in [8| and [2] is obtained under this closure. This variant is expected 
to be able to handle clustered networks more accurately than the variant con¬ 
sidered in El and m that follows from equation [13] 


5 Discussion 

Recently it has been possible to establish exact and practicable representa¬ 
tions of stochastic epidemic dynamics on finite tree networks HU using closure 
methodologies evaluated at the level of individuals M- Message-passing also 
gives exact representations on trees m and this can be shown, under some 
circumstances with Poisson transmission processes, to be equivalent to moment 
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closure models m ■ Under suitable and very restrictive homogeneity assump¬ 
tions, population-level versions of these closed models (e-g. 0) can also be 
exactly derived on idealised graphs with homogeneous initial conditions [5j. 

Within the individual-level closure construction, it is possible to go beyond 
trees and obtain exact representations of epidemic dynamics on some networks 
with cycles using the idea of dynamical partitioning on the graph m Here we 
defined dynamical partitioning on arbitrary networks and also observed that it 
applies to both Markovian and non-Markovian SIR dynamics. In the Markovian 
case with Poisson transmission and removal processes, we can use dynamical 
partitioning to define exact SIR moment closure models. The extent to which 
these models are computationally viable depends primarily on the underlying 
structure of the network. 

More specifically, starting from the probabilities of the states of individual 
nodes in a given network, we uniquely defined the full set of exact induced 
moment equations by automatically implementing dynamical partitioning where 
applicable. We also defined transmission blocks as a natural decomposition of a 
network for the closure of SIR models. Transmission blocks represent a possible 
extension of the block concept in graph theory into directed networks. Using 
this concept, we proved a theorem stating that the size of the largest subsystem 
state appearing in the set of moment equations is equal to the size of the largest 
transmission block. 

We also investigated hierarchies of approximate moment closure models. In 
the epidemic literature, it is normally the case that moment closure models are 
constructed at the level of pairs, or occasionally for triples or quads mil EJ. 
This is often accompanied with an assertion that higher order models exist. 
However, to our knowledge, these higher order epidemic models have never been 
defined explicitly. This is understandable since these models rapidly become too 
complex to be of real practical relevance, but it does leave open the theoretical 
question of how these models can be defined 0- To address this, we introduced 
‘pseudo-partitioning’ to construct complete hierarchies of approximate closed 
models that are well-defined at all orders. In fact, we defined several hierarchies 
of closed models; one in terms of motif size, one in terms of the size of cycles 
in the network, and a hybrid method taking the best of both of the previous 
methods. Undoubtedly other hierarchies can be defined as well. In addition, 
we investigated two mechanisms of closure - one based on exact dynamical 
partitioning and the other which is more related to the Kirkwood closure. 

The closure based directly around dynamical partitioning has the variant of 
the closure model considered by m as its zeroth order variant (for all of the 
size, cycle and hybrid approaches). The hierarchies based around the alternative 
closure all have the model introduced in 0 and 0 as their zeroth order variant 
(this is designed to handle networks with clustering in a more effective way). We 
also observed that the conditions for cycle-partitioning at orders greater than 
zero mean that both methods of closure become equivalent. 

The hierarchies of models generated some interesting observations concern¬ 
ing the convergence to exactness with order. For example, for size-partitioning, 
the models converge to the exact solution with increasing order, but this conver- 
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gence is not always monotonic (see Figure 0). It is typical for moment closure 
models of SIR epidemics to over-exagger ate the spread of an epidemic, but here 
we observed a counter example (see also [Si where this is discussed as a pos¬ 
sibility). An unanswered question is whether the approximate models always 
increase in accuracy as the order of the hierarchy increases. Intuitively we would 
expect that they do, and this is validated by the examples so far investigated. 
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A Proof of the underpinning results for Theo¬ 
rem 13.31 

Theorem 13.31 follows from Corollarv lA.il and Lemma [A. 11 below. 

Definition A.l. A set W n C V of size \W n \ = n can be ‘generated’ from a set 
W m C V of size \W m \ = m where 2 < m < n if and only if a sequence of sets 
W m ,..., Wi ,..., W n exist where Wj+i = W) U {fc}, where k is a single node in 
V \ Wi, and there exists an arc from k towards some individual j £ Wi which 
is not dynamically partitioning relative to k and Wi \ {j}. 

Remark. The above definition is constructed such that there exists a subsystem 
state A : W —» {S,/} belonging to Me, where \W\ > 2, if and only if W can 
be generated from some connected pair. This follows from the definition of Me 
via equation 0 

Lemma A.l. If a set W can be generated from some connected pair, then there 
exists X D W such that D[X] is a directed sub-block. There also exists some 
node i £ W that is reachable from all other nodes in both D[W] and D[X). 

Proof. The proof follows by induction. Lemma |A2] proves the statement for the 
case \W\ = 3 while Lemma fA~3l establishes the inductive step. □ 

Corollary A.l. If A : W —> {S', /} is a subsystem state belonging to Me, where 
\W\ > 2, then there exists X D W such that D[A”] is a directed sub-block. 

Proof. This follows directly from Lemma I A.l I arid Definition I A. II □ 

Lemma A. 2. If a set W where \ W\ = 3 can be generated from some connected 
pair, then there exists X D W such that D\X] is a directed sub-block, and some 
i £ W is reachable from all others in both £)[W] and D[X\. 

Proof. We focus only on directed links since directed sub-blocks cannot be 
destroyed by making links undirected. With reference to Figure [HI if a set 
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c) 
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W3 



Figure 11: Demonstration for Lemma [A.21 ‘ways’ in which a set W 3 = { i,j,k} 
can be generated from the pair {i, j}, where j is connected towards i. Note that 
W 3 is always a subset of some directed sub-block, and i is reachable from all 
others in both Z)[W 3 ] and the directed sub-block. The dashed arrows represent 
paths which may consist of any number of vertices. 


W 3 = {i,j, k} can be generated from the pair W 2 = {i,j}, with j connected to¬ 
wards i, then there is a link from k to either i or j. Further, from the definition 
of dynamical partitioning and the generating rule, there are two possibilities: 1 ) 
there exists two vertex disjoint paths Pi,P 2 from some individual (which could 
be k ) to both members of W 2 , and where k is the penultimate individual in one 
of these paths (see Figure FTTh and c), or 2) there exists a path P 3 from one 
member of IV-j to the other, and k is the penultimate individual in this path 
(see Figure HTb and d). Note that in all cases depicted in Figure ITT1 W 3 is a 
subset of some directed sub-block in which i is reachable from all others (and i 
is reachable from all others in D[W:i\). 

□ 

Lemma A.3. If the statement made in Lemma \A.l\ is true for the case where 
\W\ = n, then it is also true when \W\ = n + 1. 

Proof. Firstly, note that W n+ 1 , where \W n+ \ \ = n + 1, can be generated from 
some connected pair if and only if it can be generated from some set W n , where 
\W n \ = n, which can itself be generated from some connected pair. Now suppose 
that Lemma lA.ll is true for the case where \W\ = n, and let W n be a set 
of size n that can be generated from some connected pair. Then we have a 
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Figure 12: Demonstration for Lemma [A.31 Here, the single node in A' \ W is 
illustrative of the nodes in this set which must be connected by at least one path 
to node i, and where the underlying graph G[X ] is biconnected. We have placed 
node k outside of X, but k £ A' \ W is also permitted, a) shows k belonging to 
one of two vertex disjoint paths from some node to W and b) shows k as the 
penultimate individual in a path from a node in W to a different node in W. In 
either case, W n U {fc} is seen to always be a subset of some F3 X where D\Y] 
is a directed sub-block in which i is reachable from all others (and i is reachable 
from all others in D[W n U {fc}]). 


set A A W n such that D[X] is a directed sub-block where, without loss of 
generality, i £ W n C X is reachable from all others in both D\W n \ and D\X\. 
With reference to Figure [lU and again focusing only on directed links, if a set 
W n+ 1 = W„ U {k} (k £ W n ) can be generated from W n , then either there exist 
two vertex disjoint paths Pi, P2 from some individual to two different members 
of W n and k is the penultimate individual in one of these paths (Figure IT2kl . 
or there exists a path P3 from one member of W n to a different member of W n 
and k is the penultimate individual in this path (Figure IT2bl. This follows from 
the generating rule and the definition of dynamical partitioning. Note that if 
Pi, P2 exist then D[X U Pl U P2] is a directed sub-block in which i is reachable 
from all others (and i is reachable from all others in D[W n + 1 ]). Similarly, if P3 
exists then D[X U P3] is a directed sub-block in which i is reachable from all 
others (and i is reachable from all others in D\W n+ i ]). □ 

Lemma A. 4. If there exists X C V such that D[X] is a directed sub-block, 
then there exists a subsystem state A: X {5,/} belonging to Me- 

Proof. If D[X] is a directed sub-block in which i £ X is reachable from all 
others, then there exists at least one arc (j, i) in D[X], The lemma then follows 
from lemma [A. 51 below which proves that X can be generated from {i,j}- □ 

Lemma A. 5. Let D[X\ be a directed sub-block and let i £ W C X, where 
\W\ > 2, be reachable from all others in both D[W] and D[X], In this case, 
some set W U {fc}, where k £ X \ W, can be generated from W, and i is 
reachable from all others in D[W U {fc}]. 

Proof. From Figure [12] but with k £ X we note that some set W U {fc}, where 
k £ X\W, can be generated from W if and only if there exist two vertex disjoint 
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Figure 13: Demonstration for Lemma [A. 5 1 shows the ways in which k £ X\W 
can be connected to W. We have cases a) the underlying graph of D[X] is not 
biconnected, b) Existence of path P3, c) Existence of paths PI and P2 and 
d) Existence of a node from which W cannot be reached. Cases a) and d) are 
not directed sub-blocks so the existence of paths PI and P2, or path P3, is 
established. 


paths Pl , P 2 from some individual to two different members of W and where k 
is the penultimate individual in one of these paths, or there exists a path P 3 
from one member of W to a different member of W and k is the penultimate 
individual in this path. Our proof is by contradiction. We shall assume that 
neither of these scenarios hold and show that this contradicts the assumption 
that D[X] is a directed sub-block. 

Every individual in X \ W is at the start of a path to i in D[X\. Figure [13] 
shows the different ways in which an individual k £ X \ W may be connected to 
an individual of W in D[X]. Firstly, the underlying graph in Figure IT3h is not 
biconnected so here D[X) is not a directed sub-block. Secondly, Figures H3b 
and c correspond to the existence of path P3 and the existence of paths PI, P2 
respectively and hence WU{fc} is generated. Finally, Figure fTStl has an individ¬ 
ual from which W cannot be reached and so D[X\ is not a directed sub-block. 
Other more complicated variants of this path will also contain such individuals 
from which W cannot be reached. Hence, if paths PI and P2 do not exist, and 
path P3 does not exist, then D[X] is not a directed sub-block. 

□ 

B Proof of general results on cycle-partitioning 

The main results of this appendix are stated as Theorem IB . 1 1 and Theorem IB. 21 

Lemma B.l. Any induced subsystem state A : W —> {S',/} belonging to M c ^ 
consists of a set of individuals W C V where there is at least one individual 
reachable from all others in D[VF]. 

Proof. Follows from the way in which Mct x ) is constructed via equation [13] (or 
equation [TgI) . □ 
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Theorem B.l. If the largest transmission block in a network consists of n 
individuals, then any cycle-partitioning model of order x > n — 2 is exact. 

Proof. For any W C V where at least one individual is reachable from all others 
in D[W], if any i £ W is cycle-partitioning at order x > n — 2 with respect to 
some j (fcW and W \ i, where (j, i) is an arc, then i is also dynamically parti¬ 
tioning with respect to j and W\*. This follows because if i is not dynamically 
partitioning, but is cycle-partitioning at order x > n — 2, then this implies the 
existence of a directed sub-block containing j , i and at least one other member 
of W, and which consists of more than n individuals. Therefore, by Lemma lB.il 
M C ( X ) only utilises genuine dynamical partitioning and we have Mqm = Me 
for x > n — 2 . 

□ 

Theorem B.2. If the smallest directed sub-block consists of n individuals, then 
all cycle-partitioning models of order x < n — 2 are equivalent to the pair-level 
models. 

Proof. For any connected pair W C V {\W\ = 2), if i G W is not cycle¬ 
partitioning at order x < n — 2 with respect to j £ W and W \ i, where (j, i) 
is an arc, then there exists a directed sub-block containing W U j, and which 
consists of less than n individuals. Therefore, no such j can exist. From the way 
in which Mq/ x \ is constructed, this means that no subsystem states larger than 
connected pairs emerge and we have the pair-level model, i.e. = Mq^ 

for x < n — 2 . 

□ 

Remark. Together, Theorems IB.II and IB. 21 imply that the difference in size be¬ 
tween the largest directed sub-block (or largest transmission block) and smallest 
directed sub-block gives an upper bound on the number of distinct models that 
the cycle-partitioning approach can provide. If all directed sub-blocks are the 
same size then no models that are distinct from the pair-level model and the 
exact dynamical partitioning model emerge. However, even when this difference 
is large the number of distinct models may sometimes be small, as was shown 
to be the case for the triangle lattice (where the difference is IV — 3). 
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